Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences Supplementary Material
نویسندگان
چکیده
The sim2 data set consists of simulated sequencing reads from the human chromosome 1. The sequencing parameters as well as underlying TPM values for the 15,677 transcripts in one of the two simulated conditions were estimated using RSEM v1.2.21 [6] from the ERS326990 sample from the ArrayExpress data set with accession number E MTAB 1733. We simulated three biological replicates from each of two conditions. Three types of differential abundance were introduced between the two conditions, all assigned to genes with estimated TPMs between 0.1 and 1000 in the reference data set (the “eligible” genes):
منابع مشابه
Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences [version 2; referees: 2 approved]
High-throughput sequencing of cDNA (RNA-seq) is used extensively to characterize the transcriptome of cells. Many transcriptomic studies aim at comparing either abundance levels or the transcriptome composition between given conditions, and as a first step, the sequencing reads must be used as the basis for abundance quantification of transcriptomic features of interest, such as genes or transc...
متن کاملDifferential analyses for RNA-seq: transcript-level estimates improve gene-level inferences
High-throughput sequencing of cDNA (RNA-seq) is used extensively to characterize the transcriptome of cells. Many transcriptomic studies aim at comparing either abundance levels or the transcriptome composition between given conditions, and as a first step, the sequencing reads must be used as the basis for abundance quantification of transcriptomic features of interest, such as genes or transc...
متن کاملEmpirical Bayes analysis of RNA-seq data for detection of gene expression heterosis.
An important type of heterosis, known as hybrid vigor, refers to the enhancements in the phenotype of hybrid progeny relative to their inbred parents. Although hybrid vigor is extensively utilized in agriculture, its molecular basis is still largely unknown. In an effort to understand phenotypic heterosis at the molecular level, researchers are measuring transcript abundance levels of thousands...
متن کاملPolyester: simulating RNA-seq datasets with differential transcript expression
MOTIVATION Statistical methods development for differential expression analysis of RNA sequencing (RNA-seq) requires software tools to assess accuracy and error rate control. Since true differential expression status is often unknown in experimental datasets, artificially constructed datasets must be utilized, either by generating costly spike-in experiments or by simulating RNA-seq data. RES...
متن کاملGene-level differential analysis at transcript-level resolution
Gene-level differential expression analysis based on RNA-Seq is more robust, powerful and biologically actionable than transcript-level differential analysis. However aggregation of transcript counts prior to analysis results can mask transcript-level dynamics. We demonstrate that aggregating the results of transcript-level analysis allow for gene-level analysis with transcript-level resolution...
متن کامل